Laboratory Report: Human-supervised and fully-automatic formant-trajectory measurement for forensic voice comparison – Female voices
نویسندگان
چکیده
Acoustic-phonetic approaches to forensic voice comparison often include analysis of vowel formants. Such methods typically depend on human-supervised formant measurement, which is often assumed to be relatively reliable and relatively robust to telephonetransmission-channel effects, but which requires substantial investment of human labor. Fully-automatic formant trackers require minimal human labor but are usually not considered reliable. This study assesses the effect of variability within three sets of formant-trajectory measurements made by four human supervisors on the validity and reliability of forensic-voice-comparison systems in a high-quality v high-quality recording condition. Measurements were made of the formant trajectories of /iau/ tokens in a database of recordings of 60 female speakers of Chinese. The study also assesses the validity of forensic-voice-comparison systems including a human-supervised and five fully-automatic formant trackers under landline-to-landline, mobile-to-mobile, and mobile-to-landline conditions, each of these matched with the same condition and mismatched with the highFVC, EE&T, UNSW – Laboratory Report 2 quality condition. In each case the formant-trajectory systems were fused with a baseline mel-frequency cepstral-coefficient (MFCC) system, and performance was assessed relative to the baseline system. The human-supervised systems always outperformed the fullyautomatic formant-tracker systems, but in some conditions the improvement was marginal and the cost of human-supervised formant-trajectory measurement probably not warranted.
منابع مشابه
Forensic voice comparison with monophthongal formant trajectories - a likelihood ratio-based discrimination of "schwa" vowel acoustics in a close social group of young Australian females
An experiment is described relating to estimation of strength of evidence in likelihood ratio-based forensic voice comparison. It is asked whether a better performance is obtained from point estimation of formant pattern targets in monophthongal vowel acoustics rather than formant trajectories. The hypothesis is tested on non-contemporaneous recordings of a custom-built challenging database of ...
متن کاملLikelihood Ratio Calculation in Acoustic-Phonetic Forensic Voice Comparison: Comparison of Three Statistical Modelling Approaches
This study compares three statistical models used to calculate likelihood ratios in acoustic-phonetic forensic-voicecomparison systems: Multivariate kernel density, principal component analysis kernel density, and a multivariate normal model. The data were coefficient values obtained from discrete cosine transforms fitted to human-supervised formant-trajectory measurements of tokens of /iau/ fr...
متن کاملA first attempt at compensating for effects due to recording-condition mismatch in formant-trajectory-based forensic voice comparison
This paper reports the results of a first attempt at compensating for variability in formant-trajectory representations due to differences in suspect and offender recording conditions. Formant-trajectory measurements were made on tokens of /iau/ in a database of high-quality and mobile-to-landline-transmitted recordings of 60 female speakers of Chinese. Discrete cosine transforms (DCT) were fit...
متن کاملAn acoustic comparison of alto and tenor voices
Spectra of vowels sung a t identical pitches by two tenor voices and two alto voices a r e compared. The formant frequencies and the source spectra a r e studied by matching the spectra on a terminal analogue of the vocal tract. As regards the source spectrum the alto voices show a higher relative amplitude of the fundamental than the tenor voices. As regards the formants, ari apparent differen...
متن کاملForensic Voice Comparison Using Chinese /iau/
An acoustic-phonetic forensic-voice-comparison system extracted information from the formant trajectories of tokens of Standard Chinese /iau/. When this information was added to a generic automatic forensic-voice-comparison system, which did not itself exploit acoustic-phonetic information, there was a substantial improvement in system validity but a decline in system reliability.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012